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Introduction 


This  report  provides  a  detailed  description  of  my  accomplishments  for 
the  second  year  of  funding  supported  by  the  USAMRMC  Breast  Cancer 
Research  Program.  The  subject  of  my  research  is  to  identify  and  clone 
target  genes  for  the  transcription  factor  Egr-1.  The  purpose  is  to 
gain  an  understanding  of  the  profile  of  genetic  targets  for  Egr-1  in 
normal  breast  cells  which  are  absent  from  breast  cancer  cells.  The 
long  term  goal  is  to  build  transcriptional  profiles  for  Egr-1  and  to 
identifying  key  transcriptional  defects  occurring  in  breast  cancer  cells. 
For  this  report,  the  scope  of  the  research  is  to  first  confirm  the 
expression  of  Egr-1  in  normal,  but  not  in  breast  cancer  cells. 
Subsequently,  Egr-1  must  be  crosslinked  to  its  target  sites  in  vivo 
through  the  action  of  a  buffered  formaldehyde  solution,  and  the 
crosslinked  Egr-1,  together  with  its  bound  DNA,  isolated  for  further 
characterization. 

Body 

The  results  highlighted  from  the  first  year  documented  the  successful 
use  of  the  in  vivo  crosslinking  technique  to  recover  DNA  targets 
directly  bound  by  a  transcription  factor,  in  this  case  Egr-1. 

Additionally,  the  application  of  multiplex  PCR  allowed  the  amplification 
of  cDNA  sequences  permitting  target  gene  identification. 

In  this  report,  I  present  data  describing  my  progress  in  the  complex 
task  of  identifying  target  genes  through  which  key  Egr-1  regulatory 
effects  result  in  the  maintenance  of  normal  breast  cell  function. 

After  successfully  multiplex  amplification,  to  achieve  completion  of  the 
second  phase  of  the  project  it  was  critical  to  determine  whether  the 
protocol  faithfully  accomplished  the  amplification  of  genuine  Egr-1 
target  genes.  To  address  this,  I  proceeded  to  first  isolate  individual 
cDNA’s  to  allow  their  characterization  and  sequence  identification. 
During  this  process,  I  have  sequenced  a  full  length  cDNA  for  which  no 
homolog  or  ortholog  is  known,  by  genomic  database  searching.  I  have 
named  this  new  gene  TEX1  for  Target  of  Egr-1  by  (X)  crosslinking. 
Therefore,  to  identify  this  new  gene  as  a  target  for  Egr-1 
transcriptional  activity,  it  was  necessary  to  perform  an  in  depth 
characterization  of  the  gene.  As  shown  in  figure  1,  a  full  length  cDNA 
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was  isolated  and  sequenced.  While  homology  searches  of  DNA 
databases  revealed  no  match  to  the  cDNA,  I  found  that  the  human 
genome  sequencing  project  was  able  to  map  the  gene  to  human 
chromosome  1.  In  order  to  characterize  and  confirm  the  regulation  of 
this  new  gene  by  Egr-1,  I  performed  both  RT-PCR  and  gel  shift 
experiments.  As  shown  in  figure  2,  following  transfection  of  Egr-1  into 
H4  cells  (which  do  not  express  endogenous  Egr-1),  a  dose  dependent 
induction  of  the  expression  of  TEX1  was  noted.  This  result  indicated 
that  Egr-1  was  indeed  able  to  regulate  the  expression  of  TEX1  by 
activating  its  transcription. 


ATG  GCCG  CCGACAGTGACG  ATGG  CG  CAGTTT  CA  GCTCCCGCAG  CTTCCG  A 
M  A  ADSDDG  AVSAPAASD 
CGGTG  GT  GT  CAGCAAAAGCACAACATCTG  GG  GAGG  AG  CT  AG  TAGTCCAGG 
GGVSKS  TTSGE  ELV  VQ 
TTCCCGT  AG  TG  GATGTG  CAAAGCAACAACTT  CA  AG  GAG  ATG  TGGCCATCCC 
VPV  V  DV  QSNNFK  EMWPS 

TCCTG  CT  AG  CC  AT  AA  AG  ACAG  CT  AATTTCGT  GG  CT  GT  GG  ACACGG  AG  CT  GA 
LLLAIKTANFVAVDTEL 
G  TG  GG  CT  TG  GG  G  A  CA  GG  AA  GA  GT  TT  GCTG  AA  (X  AG  TG  CATT  G  A  GG  AA  CG  TT 
SG  LG  DRKSLLNQCI  E  ER 
A  CA  AG  GC  CG  TG  TG  TO  AT  GCTG  CC  AG  G  A  CC  CG  TT  CT  AT  CCTT  TC  CCTG  G  G  CC 
YK  AVCHAART  RS  I  LSL  G 
T  CG  CCTG  CTTCAAGCGG  CAGCCAGA  CA  AG  GG  TG  AACATT  CCTATCTG  GCTC 

lacfk  r  qpdk  gehs  yla 

AAG  TG  TT  CA  AT  CT  CACTCT  GCTG  TGCATG  GAGG  AG  TATG  TCAT  AG  AACCAAA 
QV  FNLTLL  CMEE  YVI  E  PK 
G  TCTG  TG  CA  GT  TCCT  G  A  TACA  GC  AT  GG  CT  TC  AA  CT  TC  AA  CC  AG  CA  GT  AT  GCC 
SVQFL  I  QHGFNFNQQYA 
CAAG  G  CATCCCCTACCATAAG  G  GCA  AT  GACAAGG  GTG  ATG  AG  AG  CCAGAG 
Q  G  I  EYHKGNDKGDESQS 
CCAGTCAGT  ACG  GACCCTATT  CCT  G  G  AGCT  AATCCG  AGCCCG  CCG  G  CCCCT 
QSVR  TLFL  EL  I  R  ARR  EL 
GGTGCTACACAAT  GGCCTT  AT  AG  ACTT  GGTG  TT  CCTG  TACCAG  AACTTCTAT 
VLHNGLI  DL  VFLYQNFY 
G  CACACCTCCCTG  AG  AG  TCTG  GG  AACCTT  CACCGCTG  ACCT  GT  GT  GAGATG 
ahlpe  sl  gtft  ADLCEM 
TTCCCAG  CAGG  CATT  TATG  ACACCAAATATG  CT  GCTG  AG  TT  TCAT  GCCCGTT 
F  PAG  IYDT  KYAAEFHAR 
TCGTG  GCCT  CCTACTTAGAAT  ATGCCTTCCGGAAATGTG  AACG  GG  AAAATG 
FV  ASYLEYAF  RKCER  EN 
GGAAG  CAGCGG  GCAG  CT  GG  CAGCCCACACCTTACCCT  GG  AG  TT  CT  GCAAC 
GK  QR  AAGS  PHLTL  EFCN 
T  AT  CCTT  CCAG  CATG  AG  GG  ACCATATT  GATT  ACCG  CT  GCTG  CCTG  CCCCCA 
YPSSMRDHI  DY  RCCL  PP 
GCAACCC  ACCG  TCCT  CATCCCACCAGCAT  CT  GT  GACA  ACTT  CT  CG  GCTT  AT 
ATHR  PHPT  SI  CDNFS  AY 
G  GCT  GG  T  G  CCC  CCT  GG  G  A  CC  A  CA  G  TG  T  C  CT  C  AG  T  CT  C  A  CG  A  TAT  TG  A  CCT  T 
GWC  PL  GPQC  PQSHDI  DL 
ATCATT  GACACTG  ATG  AG  GCTGCGG  CAG  AG  G  ACAAG  CG  GCG  ACGG  CGACG 
I  I  DTDE  AA  AE  DKR  RR  RR 
ACGTAGGGAAAAACGGAAG  AG  GG  CTTT  ATTG  AACCTACCGG  GG  ACACAG  AC 
RR  EKR  KR  ALLNLP  GTQC 
CTCTGGGG  AAG  CT  AAG  GATGG  TCCT  CCC  AAG  AA  GCA  GG  TCTGTGG  G  GATA 
SG  EAKDGPPKKQVCG  D 
G  CATC  A  AG  CCT  GAAG  AAACCG  AG  CAG  GAGG  T  GG  CT  G  CCG  AT  GAAACTAGG 
S  I  KPEET  EQE  V  AADETR 
A  ACCT  GCCTCACT  CCAAGCAAGGCAACAAAAAT  GACTTAGAGATG  GG  GATT 
NLPHSKQGNKNDLEMG  i 
AAG  GCAG  CAAG  GCCT  GA  AATAGCTG  AT  AG  AG  CT  ACCT  CAG  A  AG  TG  CCAG  G 
K  A  A  R  PEI  AERATSEV  PG 
G  AG  CCAA  GCCAGT  CCTA  ACCCAG  TG  CCTG  GG  GG  TG  GATT  GCACCG  GG  CT  G 
S  QASPNPV  PG  GGLHR  A 
GTTTT  GATG  CCTT  TATG  ACAG  GTTATG  TG  AT  GG  CCTATG  TG  GAAG  TG  AG  CCA 
GF  DAFMTGYVMAYVE  VSQ 
GG  GACCG  CAACCCT  G  CAGCT  CT  GG  ACCCTG  G  CTCCCT  GAAT  GCCACAATAA 
GPQPCS  SGPWLPECHNK 
GGT  AT  ATTTG  AGT  GG  CAAAG  CTG  TACCCCT  CACAGTGG  CCAAG  AGCCAGTT 
VYLSGKAVPLT  VAK  SQF 
CTCT  CG  T  T  CCT  CCA  AA  G  CCC  A  CA  ATC  A  G  AA  G  AT  G  AAG  CTC  A  CT  T  GG  G  G  CA  G 
SRSSKAHNQKMK  LTWGS 
TAG  CTG  A 
S  stop 

Figure  1.  cDNA  and  protein  sequence 
of  TEX1;  a  new  Egr-1  target  gene. 
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Figure  2.  RT-PCR  showing 
activation  of  TEX1  expression 
following  Egr-1  transfection 
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Figure  3.  Gel  shift  analysis  showing 
specific  binding  of  Egr-1  to  the  TEX1 
5'  non-coding  sequence. 


Figure  4.  TEX1  inhibits  cell  growth. 

Growth  curves  for  empty  vector  (pCEP4), 
an  unrelated  protein  (calbindin),  or  cells 
expressing  TEX1  (TEX1),  are  shown. 


Further  confirmation  that  TEX1  represents  a  direct  target  of  Egr-1 
was  shown  using  the  gel  shift  assay.  As  seen  in  figure  3,  recombinant 
Egr-1  displayed  direct  and  specific  binding  to  the  5’  non-coding  portion 
of  the  TEX1  gene.  Further  characterization  of  TEX1  has  included 
studies  examining  the  biological  activity  of  this  newly  found  protein.  I 
have  found  that  following  transfection  of  a  TEX1  expression  vector, 
cell  growth  is  dramatically  inhibited  (figure  4).  This  result  is  currently 
being  further  characterized  to  identify  the  precise  mechanism  by  with 
growth  inhibition  is  achieved.  Clearly,  the  cloning  of  this  new  gene  has 
potential  value  to  contribute  a  greater  understanding  of  how  the  loss 
of  Egr-1  in  breast  cancer  cells  contributes  to  their  uncontrolled 
growth.  Together,  these  results  confirm  not  only  the  identification  of 
an  Egr-1  target  gene,  but  have  provided  evidence  that  the  crosslinking 
technology  is  capable  of  new  gene  identification.  Having  demonstrated 
the  validity  of  the  technology  I  am  continuing  with  my  planned 
statement  of  work  by  screening  the  captured  and  amplified  cDNAs.  To 
accomplish  this  task  in  the  most  efficient  manner,  I  have  applied  the 
technology  of  DNA  microarray  screening  to  allow  simultaneous 
identification  of  the  entire  profile  of  Egr-1  targets.  A  facility  here  at 
the  Burnham  Institute  is  in  the  final  stages  of  set  up,  and  I  have  been 
allowed  to  test  the  in  house  produced  microarrays  for  my  project.  To 
date  my  profile  of  multiplex  target  have  been  analyzed  three  times, 
and  I  am  in  the  process  of  repeating  the  entire  experimental  protocol 
to  ensure  reproducibility.  This  is  necessary  given  the  variation  from 
array  to  array  that  currently  exists.  I  anticipate  that  the  results  from 
the  repeat  experiments  will  be  concluded  shortly,  following  expansion 
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of  the  Institute’s  faclity.  This  will  lead  to  the  compilation  of  consensus 
Egr-1  targets  in  breast  cells  and  be  followed  by  their  analysis  as 
detailed  in  the  third  phase  of  this  project. 


Appendix 

1)  Key  research  accomplishments: 

*  Use  of  multiplex  PCR  amplification  to  identify  a  new  gene  target  for 
Egr-1,  named  TEX1 . 

*  Characterization  of  TEX1  as  a  genuine  direct  target  for  Egr-1, 
validating  the  technology. 

*  Identification  of  TEX1  as  an  inhibitor  of  cell  growth. 

*  Application  of  DNA  microarray  screening  to  profile  captured  Egr-1 
targets  simultaneously. 


2)  Reportable  outcomes: 

Manuscript:  I.  de  Belle,  J.-X.  Wu,  S.  Sperandio,  D.  Mercola  and  E.D. 
Adamson  (2001).  In  vivo  cloning  and  characterization  of  a  new  growth 
suppressor  protein  TEX1:  a  direct  target  gene  of  Egr-1  (manuscript  in 
preparation). 

Funding  received:  University  of  California,  Tobacco  related 
diseases  research  program.  New  Investigator  Award  9KT-0078.  Start 
date,  July  2000. 
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